# Image Classification

Medai Resnet50 Brain
MIT
ResNet-50 is a deep residual network developed by Microsoft Research, widely used for image classification tasks.
Image Classification
M
aryan-anand
31
1
Cat Dog Root Me
An image classification model built with PyTorch and HuggingPics, capable of accurately distinguishing between pictures of cats and dogs.
Image Classification TensorBoard
C
danihdms
21
1
Plant Identification Vit
Apache-2.0
A plant identification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 80.96% accuracy on the evaluation set
Image Classification Transformers
P
marwaALzaabi
37
1
Utkface Race Classifications
Apache-2.0
This model is a fine-tuned version of microsoft/resnet-50 on an unknown dataset, primarily used for image classification tasks, achieving an accuracy of 84.86% on the evaluation set.
Image Classification Transformers
U
raffaelsiregar
202
1
Kat Tiny Patch16 224.vitft
Apache-2.0
KAT is a novel vision model that replaces the traditional Transformer's channel mixer with Grouped Rational Kolmogorov-Arnold Networks (GR-KAN), trained on the ImageNet-1k dataset.
Image Classification
K
adamdad
293
1
Font Identifier
Apache-2.0
A font recognition model fine-tuned on ResNet-18, achieving 78.1% accuracy on the test set
Image Classification Transformers
F
ariadnak
44
2
Font Identifier
MIT
A fine-tuned ResNet18 model for font recognition, capable of identifying 48 standard fonts with a test accuracy of 96.33%
Image Classification Transformers English
F
gaborcselle
1,292
17
Birds Classifier EfficientNetB2
Apache-2.0
A bird image classifier fine-tuned on EfficientNet-B2, capable of recognizing 525 bird species with up to 99% accuracy
Image Classification Transformers
B
dennisjooo
4,320
20
Resnet18 Catdog Classifier
Apache-2.0
A fine-tuned cat-dog image classification model based on ResNet-18, trained on the Kaggle Cats and Dogs dataset with an accuracy of 99.29%
Image Classification Transformers English
R
hilmansw
216
1
Organoids Prova Organoid
Apache-2.0
This model is a fine-tuned image classification model based on Google's ViT-base-patch16-224 on an image folder dataset, achieving an accuracy of 85.76% on the evaluation set.
Image Classification Transformers
O
gcicceri
25
1
Cola001
Image classification model generated by HuggingPics, capable of identifying different dog breeds
Image Classification Transformers
C
GiaKhanh
29
0
Pvt Tiny 224
Apache-2.0
Pyramid Vision Transformer (PVT) is a vision model based on transformer architecture, specifically designed for image classification tasks.
Image Classification Transformers
P
Xrenya
25
0
Fun
Apache-2.0
A vision model fine-tuned based on google/vit-base-patch16-224, suitable for image classification tasks
Image Classification Transformers
F
tcvrishank
16
0
Vit Base Letter
Apache-2.0
An image classification model fine-tuned on a letter recognition dataset based on Google's ViT base model, achieving 98.81% accuracy
Image Classification Transformers English
V
pittawat
93
2
Face Discriminator
Apache-2.0
A face classification model fine-tuned based on Microsoft ResNet-50, achieving 99.84% accuracy on the validation set
Image Classification Transformers
F
petrznel
23
0
Microsoft Swin Tiny Patch4 Window7 224 Ov
This is the OpenVINO version converted from the microsoft/swin-tiny-patch4-window7-224 model, designed to accelerate image classification inference.
Image Classification Transformers English
M
helenai
508
1
Doge
Doge is an image classification model generated by HuggingPics, specifically designed to recognize Doge-related images.
Image Classification Transformers
D
Johnnyboiiii
16
0
Swin Tiny Patch4 Window7 224 Isl Finetuned
Apache-2.0
A vision model fine-tuned based on microsoft/swin-tiny-patch4-window7-224, achieving 100% accuracy on the evaluation set
Image Classification Transformers
S
hazardous
17
0
Fl Image Category Multi Label
Apache-2.0
This is an image classification model fine-tuned based on Google's ViT model, trained on the fl_image_category_ds dataset with an accuracy of 66.22%.
Image Classification Transformers
F
StephenSKelley
17
1
Vit Artworkclassifier
Apache-2.0
Art style classification model based on ViT architecture, capable of identifying the art style category of input images
Image Classification Transformers
V
oschamp
41
2
Fl Image Category
Apache-2.0
An image classification model fine-tuned based on microsoft/resnet-18, trained on the fl_image_category_ds dataset
Image Classification Transformers
F
StephenSKelley
29
0
Vit Model
A ViT model fine-tuned on the preprocessed 1024 configuration dataset for image classification tasks
Image Classification Transformers
V
mm-ai
19
0
Vit Base Patch16 224 Finetuned Algae Wirs
Apache-2.0
This model is a vision classification model fine-tuned on an algae dataset based on Google's ViT model, primarily used for algae image classification tasks.
Image Classification Transformers
V
samitizerxu
20
0
Resnet 50 4 32
Apache-2.0
An image classification model fine-tuned based on microsoft/resnet-50, achieving an accuracy of 64.1% on the evaluation set
Image Classification Transformers
R
Celal11
26
0
Poke Model
Gpl-3.0
A vision classification model fine-tuned based on google/vit-base-patch16-224 for recognizing first-generation Pokémon
Image Classification Transformers
P
torresflo
23
1
Bald Or Not
A simple image classification model based on PyTorch and HuggingPics, used to determine whether a person in an image is bald.
Image Classification Transformers
B
mvaloatto
28
3
Yolo V8 Fog Or Smog Classification
An image classification model based on YOLOv8 for identifying fog and smoke scenes.
Image Classification TensorBoard
Y
uisikdag
23
0
Genderage2
Apache-2.0
Vision Transformer model based on ViT architecture for gender and age classification tasks
Image Classification Transformers
G
ivensamdh
263
3
Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013 7e 05 Finetuned SFEW 7e 05
Apache-2.0
An image classification model based on the BEiT architecture, fine-tuned on the FER2013 dataset for facial expression recognition
Image Classification Transformers
B
lixiqi
18
0
Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013CKPlus
Apache-2.0
This model is an image classification model based on the BEiT architecture, fine-tuned on the FER2013CKPlus dataset for facial expression recognition tasks.
Image Classification Transformers
B
Celal11
19
0
Efficientformer L3 300
Apache-2.0
EfficientFormer-L3 is a lightweight vision Transformer model developed by Snap Research, optimized for mobile devices to achieve low latency while maintaining high performance.
Image Classification English
E
snap-research
279
2
Swin Small Finetuned Cifar100
Apache-2.0
A small model based on the Swin Transformer architecture, fine-tuned on the CIFAR-100 dataset for image classification tasks
Image Classification Transformers
S
MazenAmria
37
0
Efficientformer L1 300
Apache-2.0
EfficientFormer-L1 is a vision Transformer model developed by Snap Research, optimized for mobile devices to achieve extremely low latency while maintaining high performance.
Image Classification English
E
snap-research
513
3
Swin Tiny Finetuned Cifar100
Apache-2.0
Image classification model fine-tuned on CIFAR-100 dataset based on Swin Transformer Tiny architecture
Image Classification Transformers
S
MazenAmria
63
1
Vit Base Patch16 224 In21k Finetuned Cifar10 Test
Apache-2.0
A fine-tuned test version of Google Vision Transformer (ViT) base model on CIFAR-10 dataset
Image Classification Transformers
V
minhhoque
30
0
Vit Hybrid Base Bit 384
Apache-2.0
The Hybrid Vision Transformer (ViT) model combines convolutional networks and Transformer architectures for image classification tasks, excelling on ImageNet.
Image Classification Transformers
V
google
992.28k
6
Dataset Model
Apache-2.0
An image classification model based on ViT architecture, fine-tuned on an image folder dataset
Image Classification Transformers
D
Farideh
30
0
3d Printed Or Not
MIT
This is an image classification model used to determine whether an image is of a 3D-printed object.
Image Classification English
3
cmudrc
39
2
Vit Base Patch16 224 In21k Finetuned Cifar10 Album Vitvmmrdb Make Model Album Pred
Apache-2.0
A Vision Transformer (ViT) based model fine-tuned on the CIFAR-10 dataset for image classification tasks
Image Classification Transformers
V
venetis
30
0
Image Spam Detection Keras2
This is a model based on the Keras framework, with unspecified functionality, possibly used for image classification or spam detection tasks
Text Classification
I
fbadine
31
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase